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ABSTRACT 

We present a hierarchical Bayesian determination of the velocity-dispersion function of approxi- 
mately 430,000 massive luminous red galaxies(LRGs) observed at relatively low spectroscopic signal- 
to-noise ratio (SNR ~3-5 per 69kms~^) by the Baryon Oscillation Spectroscopic Survey (BOSS) of 
the Sloan Digital Sky Survey HI (SDSS-III). We marginalize over spectroscopic redshift errors, and use 
the full velocity-dispersion likelihood function for each galaxy to make a self-consistent determination 
of the velocity-dispersion distribution parameters as a function of absolute magnitude and redshift, 
correcting as well for the effects of broadband magnitude errors on our binning. Parameterizing the 
distribution at each point in the luminosity-redshift plane with a log-normal form, we detect signifi- 
cant evolution in the width of the distribution toward higher intrinsic scatter at higher redshifts. Using 
a subset of deep re-observations of BOSS galaxies, we demonstrate that our distribution-parameter 
estimates arc unbiased regardless of spectroscopic SNR. We also show through simulation that our 
method introduces no systematic parameter bias with redshift. We highlight the advantage of the 
hierarchical Bayesian method over frequentist "stacking" of spectra, and illustrate how our measured 
distribution parameters can be adopted as informative priors for velocity-dispersion measurements 
from individual noisy spectra. 

Subject headings: galaxies: evolution, kinematics and dynamics — methods: statistical — techniques: 
spectroscopic 



1. INTRODUCTION 



Massive elliptical galaxies (EGs: lHubblel[l936D are one 
of the most important classes of astrophysical objects for 
galaxy evolution and cosmology. They represent the end 
stage of hierar chical galaxy- formation processes (e.g., 
iKauffmann et a l. 1993; Baugh ct al. 1996), and therefore 
their properties and scaling relations represent a key test 
for theories of galaxy formation within a cosmological 
context. In addition, since they are the most luminous 
and highly clustered galaxies, they serve as ideal cosmo- 
logical tracers of clust ers and large-scale structure (e.g., 
lEisenstein et al.ll2005f ). 

To a first approximation, EGs are "pressure- 
supported" rather t han ro tationally support e d (e.g., 
Bertola fc Capacciolil 119751 : llUingworthl (l977l: iBinnevI 
1978[) . with their stellar motions characterized by a ve- 



locity dispersion a. Among the many observational pa- 
rameters of massive elliptical galaxies, a is unique in 
its direct sensitivity to the depth of the galaxy's grav- 
itational potential (and therefore to its mass), and in 
its relatively weak dependence on observational aper- 
ture. In combination with galaxy sizes (i.e., half-light 
radii), velocity dispersions can be used to determine 
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"dynamical masses" that a re independent of stellar- 
population assumptions (e.g.. iPadmanabhan et al]l2004l : 
IBolton et al . 2008b). Dynamical masses can then in turn 
be used to trace the evolution of EGs at fixed mass (e.g., 
van der Marel &: van DokkumI [2007t | van der Wei et aLl 



200a ICappellari et al.ll20'09r indicating a nuanced dy- 



namical history despite generally passive star-formation 
histo ries at z < 1 (e.g.. iThomas et al1l2005l : ICool et al.l 
I2008f ). Stellar velocity dispersion is also the most im- 
portant single predict or of strong gravi t ational lens- 
ing cro ss sections (e.g., [Turner et al.l 1198-5 IBolton et al.l 
I2008a[) . and can be used in combination with strong lens- 
ing observations to constrain the central mass-density 
struc ture of elliptica l galaxies at cosmological distances 
Ce.g..lKoopmans fc T reu 200l iTreu fc KoopmansI [200l 
iKoopmans et al.ll2006() . Stellar velocity dispersions are 
tied to nearly all other properties of EGs through 
multip le empirical scaling relations. iFaber &: Jacksonl 
(jl976[) found a correlation between luminosities of early- 
type galaxies and their velocity dispersions a known 
as the Faber- Jac kson Relation (FJR). The relation of 
iKormendvl (|1977t) ties the surface brightness (I)c with 
the effective radius Ro. Both the FJR and Kormendy re- 
lations can be v iewed as projections of the "fundamental 
plane" (FP, e.g., EI orgovski fc Davislll987l : lDressler et al.l 



plane (fr, e.g..lUiorgovski &: UavisllliJgi'tlUressler et al.l 
[l987t IBernardi et al.l l2003c| ) within the space spanned 



by logj^o-R-e, (I)e and logj^g Furthermore, central 
black hole mass has been found to be correlated with 
the velocity dispersion of the bulge v i a the Mbh — cr 
relationfe.g.. iFerrarese fc MerrittI 120001: iGebhardt et al.l 
l200nl : iKoTinendv fc Bended I2OO90 . Together, these rela- 
tions provide multiple constraints on the structure, for- 
mation, and evolution of EGs . 
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Although velocity dispersion plays a starring role in the 
study of EGs, it is an "expensive" observable that must 
be measured spectroscopically. Hence, large samples of 
galaxies with well-measured velocity dispersions across 
cosmic time are largely unavailable. Measurements of 
tj are made by quantifying the line-of-sight Doppler 
broadening of absorption lines relative to a set of tem- 
plate stellar spectra , either via the Fourier method (e.g., 
iSargent et a l.lll977l: Pibnrv fc Davij|1979D or the direct- 
fitting method (e.g.. IBurbidge et allll961l : IRix fc White! 
[1992 ') . Both methods rely on the quality of galaxy spec- 
tra: for spectra of low signal-to-noise ratio (SNR), un- 
certainties in the measured stellar velocity dispersion can 
be very large and significantly non-Gaussian. This as- 
pect is of particular concern for galaxies at cosmologi- 
cal distance (faint even if luminous), which can only be 
measured at high SNR through substantial investment of 
spectroscopic observing time and aperture. 

In this paper, we introduce a hierarchical Bayesian sta- 
tistical method to measure the parameters of the distri- 
bution of stellar velocity dispersions within a popula- 
tion of galaxies that has been observed with relatively 
low spectroscopic signal-to-noisc ratio. We apply the 
method to approximately 430,000 luminous red galaxy 
(LRG) targets f rom the Baryon Osci llation Spectroscopic 
Survey (BOSS: ISchlegel et all 120091) . one of four survey 
pro jects within the Sloan Digital Sky Survey III (SDSS- 
III: lEisenstein et al.ll201ll ). We quantify the evolution of 
the velocity-dispersion function of BOSS galaxies, and 
detect significant evolution in the intrinsic population 
RMS of logiQ (7 at fixed absolute magnitude since z « 0.8. 

This paper is organized as follows. In Section [51 we 
describe the sample selection and the method for ve- 
locity dispersion measurement. Section [31 presents our 
statistical method for the measurement of the distribu- 
tion of stellar velocity dispersions within a population 
of galaxies, including a verification using high- SNR re- 
observations of a sub-sample of galaxies and a test for 
redshift-dependent systematic biases. Section |4l presents 
the results of our application of this method to the BOSS 
sample, showing the evolution of the velocity-dispersion 
function at fixed magnitude. Discussion and conclusions 
are presented in Section [51 Throughout the paper, we 
assume a standard general-relativistic cosmology with 
= 0.3, Ha = 0.7 and Hq = 70 km s^^ Mpc^^.' 

2. SPECTROSCOPIC DATA 

2.1. Sample Selection 

We use spectroscopic data obtained by the BOSS 
project via the 2.5-m SDSS telescope locate d at Apache 
Point Observatory in Sunspot, New Mexico (|Gunn et al.l 
[20061) . The primary science goal of BOSS is the de- 
tection of the baryon acoustic feature in the two-point 
correlation function of galaxies (and quasar absorption 
systems), from which to constrain the distance-redshift 
relation and the nature of dark energy. BOSS also of- 
fers a unique resource for the study of the properties and 
evolution of massive galaxies. The BOSS footprint covers 
approximately 1 , deg^ in five imaging filters {ugriz, 
iFukugita et al.|[i996D . and will by 2014 obtain spectra of 
about 1.5 million LRGs out to redshift z ~ 0.8. Note that 
the majority of the BOSS LRG targets are massive EGs, 
although there is a small fraction of late-type galaxies as 



well as unres olved multiples, par ticularly at the higher 
redshift end ([Masters et al.ll201lD . 

The BOSS spec tra are broadly comparable to SDSS-I 
([York et al.ll2000( ) spectra in resolution (R w 2000), and 
cover a wavelength range from 3,600A to 10,000A. The 
primary design goal of BOSS is to measure as many red- 
shifts as efficiently as possible, in order to map the largest 
possible volume of the universe. Consequently, the SNR 
of the galaxy spectra is significantly lower than in SDSS- 
I, with typical SNR values of 3 to 5 per 69kms~^ (re- 
binned) pixel, as compared with > 10 per pixel in SDSS- 
I. Thus although the BOSS spectroscopic database is by 
far the largest available for the study of massive galaxies, 
the individual spectra are well below the SNR threshold 
of about 10 per A generally regarded as a minimum for 
acceptable velocity-dispersion measurement on a galaxy- 
by-galaxy basis. Motivated by this context, we develop 
the Bayesian analysis method presented below. 

Spectroscopic calibration, extraction, classification, 
and redshift measurement of all BOSS galaxy spectra 
are carried out usi ng the idlspec2d software (sec. e.g., 
lAihara et"aI1 1201 ID . written originally for SDSS-I and 
recently updated to handle the data format and noise 
regime of BOSS. In selecting our analysis sample, we 
make the following cuts based upon the redshift pipeline 
output: 

• We use only the best spectroscopic observation of 
any given galaxy target as some objects are ob- 
served more than once (SPECPRIMARY=1 according 
to SDSS terminology). 

• We use only objects that were both targeted as 
galaxies and spectroscopically confirmed as galax- 
ies 

• We require a confident redshift measurement with 
no warning flags (ZWARNING = according to SDSS 
terminology) [3 

These cuts return approximately 430,000 galaxies from 
the first 1.5 years of BOSS spectroscopic observations, 
with redshifts ranging from zero to 1, but concentrated 
primarily over the interval 0.2 < z < 0.8. 

For all selected galaxies, we use the measured spec- 
troscopic redshifts and SDSS broadband imaging col- 
ors to compute absolute rest-frame l^-band magni- 
tudes and associated uncertainties via the sdss2bessell 
routine implemen t ed in the kcorrect software of 
iBlanton fc RoweisI (|2007[ ). 

The details of the BOSS galaxy target selection, and 
the corresponding incompletenesses, are the subject of 
a separate paper (Padmanabhan et al., in preparation). 
Here we summarize the target selection cuts for the two 
main galaxy target classes that we focus upon in our 
current study. The first is the CMASS sample (for "con- 
stant mass"), which is selected photometrically to de- 
liver LRGs of approximately constant stellar mass over 
the redshift interval 0.3 < z < 0.8, and which consti- 
tutes approximately 76% of the galaxies selected above. 
The second sample, LOZ, is selected to target LRGs 
at lower redshifts. and represents the remaining 24% of 

^ For BOSS galaxies, the specifically relevant flag is 
ZWARNING_NOqSQ = (Bolton et al., in prep.) 
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Fig. 1. — Distribution of galaxies for our sample, along with his- 
tograms of redshift z and V-band absolute magnitudes Aly- LOZ 
galaxies (black) and CMASS galaxies (gray) are plotted separately. 
For both the samples, contours are drawn at constant number den- 
sity in the z-My plane, enclosing 50%, 90%, and 99% of the sam- 
ple. 

the selected galaxies. Defining the foUowi nR quantities 
(jEisenstein et al.ll2001t ICannon et al.ll2006[) : 

C|| = 0.7(g- r) 1.2[(r- - 0.18] (1) 

ci_ = (r-i)-(.g-r)/4.0-0.18 (2) 

d± = {r-t)-{g-r)/8.0 (3) 

ifiber2 — i-band fiber magnitude for 2" fibers, (4) 

the CMASS sample is defined by the photometric cuts: 

17.5 < i < 19.9 (5) 

r-i<2 (6) 

di_ > 0.55 (7) 

ifiber2 < 21.7 (8) 

i< 19.86 -Kl.60(di- 0.80) (9) 

as well as a cut to exclude galaxies with major-axis half- 
light radii greater than 8". Equations and ([7]) aim 
to select galaxies between redshifts z ~ 0.4-0.8, while 
Equation ([9]) attempts to impose a cut at constant stel- 
lar mass across this redshift range. The LOZ sample is 
defined by the cuts: 



r < 13.5 + C||/0.3 
|c_l| < 0.2 
16 < r < 19.6 . 



(10) 

(11) 
(12) 



Equation (|T0| sets up a magnitude threshold as a func- 
tion of redshift and Equation (fTTI) picks out low-rcdshift 
galaxies specifically. 

The redshift-absolutc magnitude distributions of these 
two BOSS galaxy samples, with associated ID projec- 
tions, are plotted in Figure [1] In the following analy- 
sis, we will treat the two populations separately, since 
the combined sample does not define a simple locus in 
luminosity-redshift space, with LOZ galaxies being of 
generally higher luminosity over the redshift range where 
the two samples overlap. 

2.2. Velocity Dispersion Extraction 



Our strategy for extracting velocity-dispersion infor- 
mation is to make use of the full velocity-dispersion 
likelihood function for each galaxy spectrum. To do 
this, we make use of the IDL routine vdispf it within 
the idlspec2d product of spectroscopic analysis soft- 
ware. This software has been extensively tested in the 
SDSS-I, and has been upgraded for the analysis of BOSS 
data. Velocity dispersions measured with this software 
have been the ba sis for multiple studie s of the dynam- 
ics of EGs (e. g.. iBernar di et al.' '2003a''b",'?; 'Shcth et alj 
|2003; Padman abhan et al . 2004; Koopmans et al. 200^ 
To summarize briefly: vdispf it uses a set of stellar 
eigenspectra derived from a principal-component anal- 
ysis (PGA) de composition of the ELODI E stellar spec- 
trum library (jPrugniel fc SoubiranI |2001[ ) . The eigen- 
spectra are convolved and binned to the resolution and 
sampling of the BOSS spectra, then broadened by Gaus- 
sian kernels of different trial velocity dispersions. The 
broadened templates are then shifted to the redshift of 
the galaxy under consideration. After masking out re- 
gions containing common emission lines, a linear least- 
squares fit is performed to obtain a best-fit model spec- 
trum at each trial velocity dispersion. The resulting 
curve of ^-s a function of trial velocity dispersion en- 
codes the likelihood function of velocity dispersion given 
the data. For measurements from high signal-to-noise 
spectra, the position of the minimum is adopted as 
the maximum-likelihood estimate of the galaxy's velocity 
dispersion. Below, rather than adopt these estimates, we 
will work with the full likelihood function. 

In this procedure, we must choose the number of stellar 
eigenspectra to use in forming the template basis. The 
pipeline analysis of SDSS-I data used the first 24 PGA 
modes. For the much lower signal-to-noise BOSS data, 
an acceptable can be obtained using only the first 
5 PGA modes, and hence we restrict our basis to this 
smaller number of eigenspectra so as to avoid fitting noise 
fluctuations. 

As described above, before being fit to the galaxy spec- 
tra, the stellar eigenspectra are shifted by the appropri- 
ate galaxy redshifts. If the redshifts have non-negligible 
errors, the corresponding offsets can introduce a bias into 
the measured velocity dispersion. Although the BOSS 
spectra provide redshifts with a precision well in ex- 
cess of what is required for large-scale structure stud- 
ies and absolute-magnitude determinations, their errors 
can be non-negligible on the scale of internal galaxy ve- 
locity dispersions. Therefore, we implement a marginal- 
ization over redshift errors in our analysis. Specifically, 
we modify the vdispf it routine to take a radial velocity- 
marginalization range Az (expressed in constant-velocity 
pixels) and the redshift error 6z (the ±68% confidence 
interval as estimated by the idlspec2d pipeline) as ar- 
guments. Then we calculate x^(o':-z) for a set of trial 
redshifts in the range z ± Az and define a new effective 
X^(o') by integrating over z as 



X^(cr) = -21n / dz exp 

\Jz-Az 



X^(o-,2) 



(13) 

We assume a Gaussian probability distribution for z 
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jiven by 



p{z) (X exp 



{z — 2;bost)^ 

2S^ 



(14) 



where Zbest is the best-estimate rcdshift from the BOSS 
spectroscopic pipeline. The choice of a Gaussian prior is 
made because the galaxy redshifts have been determined 
using absorption and emission-line information over the 
full optical range of the BOSS spectrograph, whereas the 
velocity-dispersion fitting is done only over the 4100- 
6800 A rest-frame range covered by the ELODIE spectra, 
while also masking the wavelength positions of common 
emission lines. We also explored the use of a flat prior 
to completely marginalize over redshift in the velocity 
dispersion analysis, and found only a negligible change 
(at most a few percent) in the derived relations. For 
most galaxies, the effect of this marginalization on the 
curve is insignificant, but since we wish to avoid in- 
troducing any spurious broadening into our population 
analysis, wc apply the procedure to all spectra. 

In this work, we do not make any aperture correc- 
tion for velocity dispersions, although the angular BOSS 
fiber radius of 1" subtends a different physical length 
scale as a function of redshift. Since aperture velocity 
dispersions are seen in the local universe to depend on 
aperture radius on ly to a weak power of approximately 
0.04 to 0.06 (e.g., | j0rgcnseneLalJ|1995|; IMehlert et atl 
120031: ICappellari et al. 200a^ . this effect should be rela- 
tively insignificant. For example, taking a redshift range 
spanning the majority of our CMASS sample, the angu- 
lar size of a fixed physical length at z = 0.8 is about 
72% of its angular size at z = 0.4. Assuming the veloc- 
ity dispersion within an aperture decreases as the aper- 
ture to the power —0.05 (a representative compromise 
value between the previous three references), this would 
correspond to a systematic change in measured velocity 
dispersion of about 1.7%, which is well below the level 
of other uncertainties in our analysis. In addition, the 
typical atmospheric seeing of approximately 1.8" deliv- 
ered to the BOSS spectroscopic focal plane will dilute 
the significance of the varying projected fiber scale. Es- 
sentially, BOSS velocity dispersions will represent a fair 
luminosity-weighted average value over the half-light ra- 
dius of most target galaxies, which have half-light radii 
on the order of 1". 

3. STATISTICAL POPULATION ANALYSIS FORMALISM 

The results of lBernardi et al.l ()2003bD suggest that the 
distribution of velocity dispersions for early-type galax- 
ies at fixed luminosity can be well approximated by a 
log-normal function. Motivated by this, we will assume 
a Gaussian distribution in logj^Q a with mean m and in- 
trinsic scatter s: 



P(logio s) 



1 



/27r.s 



exp 



10 ' 



2s2 



(15) 



Wc will treat m and s as functions of rcdshift and ab- 
solute magnitude, although we will suppress this de- 
pendence in our notation for convenience . Com pared 
to the SDSS-I studies by iBernardi et"all (|2003b[ ) and 
ISheth et all (l200l . we have a much larger sample with 
greater redshift coverage, so we may investigate the evo- 
lution of both the mean and intrinsic scatter of logj^Q a 
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Fig. 2. — Results for stacking of spectra within a single redshift- 
luminosity bin. The top two panels show typical individual spectra, 
while the bottom panel shows the high-SNR stacked spectrum for 
that bin, resulting from averaging the spectra of ~ 200 galaxies. 



with redshift and luminosity as e ncoded by these two 
distribution parameters (See also IBezanson et al.l 120111 
for a complementary analysis in terms of photometric 
velocity-dispersion proxies). Our strategy will be to an- 
alyze samples binned by an interval of 0.04 in redshift z, 
and by 0.1 in absolute magnitude My- 

3.1. Frequentist Approach 

As mentioned above, the SNR of BOSS galaxy spec- 
troscopy is typically rather low, especially at the high- 
redshift end of the survey. Therefore, point estimation of 
the velocity dispersion of individual galaxies is of ques- 
tionable reliability. Hence, we resort to analyzing the 
data by binning galaxies in the z-My plane, requiring 
at least 100 galaxies in every single bin. The most ob- 
vious first approach to determining the mean velocity 
dispersion in these bins is to remove the small relative 
redshift differences within the bin, stack all the spectra 
directly, and analyze the resulting high-SNR combination 
(see Figure ^ . Although we do not adopt this method 
for our ultimate determinations of m and s, it is instruc- 
tive to consider how such an approach relates to these 
parameters. 

While a velocity dispersion can be measured at high 
SNR from the stacked spectrum, the measured value 
bears a non-trivial relation to the parameters m and 
s, which we now derive. Assuming equal luminosities 
within the bin (which basically holds by construction 
due to binning in absolute magnitude), what we measure 
from the stack crgj.g^j,j, is the population-weighted expec- 
tation value of i.e. 



0-= 



tack 



'^'^ p{'^oeia<^\'m,s)dlogj_„a . (16) 



The variance of Cgtack given by 

Var(cTLck) = ^Var(cT2) = ^((a^) - {a^) (17) 

with N being the number of galaxies in the bin. 
Making use of the following relation, which can be de- 
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rived for our log- normal form of Equation p5|) : 



'p(logiocr|m,.s)dlo, 



. 2Q[«m+n^ ln(10)s^/2] 



we find that 



2 

stack 



-[^Q[2m+2 In(lO) s^] 



(18) 

(19) 
(20) 



Thus we see that the velocity dispersion measured from 
the stacked spectrum is not given by the mean log-a 
value alone, but rather includes a contribution from the 
width of the population distribution as well. In principle, 
if a good estimator of Var(crgjg^^]^) can be obtained, the 
system can be closed and solved for m and s indepen- 
dently. Although we attempted to estimate Var(o'g^j^^j^) 
via bootstrap resampling within each bin, we found the 
treatment of observational errors and varying signal-to- 
noise ratio among the spectra to be intractable within 
this framework. Rather than working further from mea- 
surements of stacked spectra, we proceed to the hierar- 
chical Bayesian method described in the following sec- 
tion. 

3.2. Hierarchical Bayesian Approach 

To constrain the distribution parameters m and s 
within each redshift-magnitude bin, we consider the fol- 
lowing expansion of the likelihood function C{m, s\{d}) 
in the bin: 



C{m,s\{d})=p{{d}\m,s) 



(21) 



= n / ?'KUogiof^)p(logio'^l"^,s)dlogio(T 

Here {d} is the set of all spectra in the bin, with each 

element di representing the spectrum of the z**^ galaxy. 

The expression p((ii I log^o cr) is related to the x^(logio cr) 
function by 

X?(logio ct)" 



p(di|logio cr) (X exp 



(22) 



and p(logj^g cr|TO, s) is given by Equation (jlSp . Translat- 
ing into Bayesian terms, we have a posterior probability 
for TO and s given by 



p(to, s|{(i}) cx p({d\\m, s)p{m, s) 



(23) 



with p{m, s) being the prior probability distribution for 
TO and s. For simplicity, we assume a uniform prior on to 
and s over a reasonable range. In actuality, we find that 
the likelihood is quite strongly peaked in each bin, so the 
exact nature and range of the prior are insignificant. 

3.3. Verification 

To verify the correct functioning of our Bayesian ap- 
proach, we make use of data from BOSS plate 3851. Due 
to a CCD failure on one of the two BOSS spectrographs 
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Fig. 3. — Engineered sub-sample Gaussian histogram in logj^g cr 
constructed using measurements from 7-hour BOSS observations 
(black), with histogram of same sub-sample using velocity disper- 
sions measured from 1.75-hour observations (gray). The two his- 
tograms have been given a slight relative horizontal offset, for dis- 
play purposes. The vertical dashed line indicates the mean log^Q a 
value of 2.33 for the sub-sample. Note the relative broadening of 
the 1.75-hour histogram due to the effects of observational error. 

that temporarily suspended normal survey operations, 
500 of the 1000 targets on this plate were plugged and 
observed for a total integration time of 7 hours (28 expo- 
sures of 15 minutes each) over the course of several nights 
ending on 2010 April 12, denoted within the SDSS-III 
database by the modified Julian date (MJD) of 55298. 
Subsequent to the replacement of the failed CCD, the en- 
tire plate was re-plugged and observed for a more typical 
BOSS integration time of 1.75 hours total on MJD 55302. 
The set of re-observed targets allows us to compare BOSS 
galaxy spectra of standard survey depth with spectra of 
the same objects at essentially double the nominal sur- 
vey SNR. We use these repeat observations to verify 
that our method (1) does not have a signal-to-noise ratio 
dependent bias in the estimation of velocity-dispersion 
distribution parameters, and (2) reproduces the known 
distribution of velocity dispersions within a controlled 
sample, as measured from the high-SNR set of spectra. 

Between the deep and shallow re-observations, there 
arc 308 galaxies which have equal redshifts (within Az — 
±0.005) and classifications for both observation dates. 
Since the sample is heterogeneous in magnitude and red- 
shift, we select a sub-sample with a controlled distribu- 
tion in velocity dispersion. We restrict our attention to 
galaxies that have their individual velocity dispersions 
measured at SNR of 10 or more from the 7-hour obser- 
vations, and that have redshifts between 0.4 and 0.6. We 
then select a random sub-sample of 125 galaxies from 
this set so as to have a Gaussian histogram in logj^Q cr 
with a mean of to = 2.33 and an intrinsic RMS scatter of 
s — 0.07. The histogram of this sub-sample, along with 
the histogram of the same sample as constructed from 
galaxy- by-galaxy measurements using the 1.75-hour ob- 
servations, is shown in Figure [31 

The frcquentist formulas given by Equations (jl9p and 
([20]) do not account for observational error, and hence we 
do not use them to solve for to and s estimates for our 
relatively low signal-to-noise BOSS survey data. How- 
ever, our subsample of high signal-to-noise 7-hour ob- 
servations allows us to test them, which we do before 
proceeding to the verification of our Bayesian analysis 
framework. First, we use Equation (jl9p with a mean of 
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Fig. 4. — Credible-region contours of constant posterior probabil- 
ity density for m and s parameters measured from the engineered 
test sub-sample of galaxies observed with both 7-hour integrations 
(black) and 1.75-hour integrations (gray). The symbol is the lo- 
cation of the parameters chosen for the construction of the test 
sub-sample. The offset in s between the contours and the symbol 
is a result of the proper deconvolution of observational uncertainty 
that is implemented by the Bayesian method. 

2.33 and an intrinsic scatter of 0.07 to predict a value 
of (Tstack = 219kms~^, which is in very good agreement 
with the result of (222 ± 12)kms~^ that we obtain by 
fitting the stacked spectrum of this set of 125 galaxies di- 
rectly. Similarly, we predict [Var((Tgj3^^]j)]^/^ = 40kms^^ 
from Equation (j20p . which is in reasonable agreement 
with the value of 46kms~^ obtained through a boot- 
strap resampling process. In both cases the agreement is 
not exact because there is still some observational error 
even in the 7-hour data, but as mentioned above, we will 
pass to the Bayesian framework to quantify these effects. 

We next carry out the estimation of the m and s pa- 
rameters of the selected sub-sample of objects, using the 
Bayesian approach described above, for both the 7-hour 
and the 1.75-hour data sets. FigureH]shows the resulting 
posterior probability density for these parameters as esti- 
mated from both data sets. As expected, we see that the 
posterior PDF is tighter for the 7-hour data. More im- 
portantly, we see no significant bias in the posterior PDF 
between the low-SNR and high-SNR data sets. This is 
especially significant for the estimation of the s parame- 
ter: if we were not handling our observational uncertain- 
ties correctly, we might expect to infer a broader intrin- 
sic distribution (higher s value) from the noisier data, 
but this not the case. We also see that the parameters 
used to engineer the subsample are recovered with no 
significant bias in m. We see a slight offset of the 7- 
hour maximum posterior s value from the input value 
used to engineer the sample. This is in the direction and 
of the size to expected given the observational error of 
the 7-hour individual-spectrum velocity dispersion mea- 
surements, which have an RMS signal-to-noise of about 
17. This corresponds to an observational broadening of 
about 0.025 dex in the engineered histogram of Figure [31 
which is deconvolved by the Bayesian parameter estima- 
tion procedure to give the lower recovered s value seen 
in Figure 21 

Another concern is that there might be a systematic 
bias with redshift, since the spectral regions used by 
vdispf it in fitting for velocity dispersions (rest frame 
wavelength range from 4,100A to 6,800A) move to the 



redder and noisier parts of the spectrum as the red- 
shift gets higher. In order to test this, we construct an- 
other controlled sub-sample with 152 galaxies of redshift 
z < 0.2 and very high SNR. Then we take the best-fit 
template combination models of those 152 galaxies re- 
turned by vdispf it and redshift them to progressively 
higher redshift bins, giving them a uniform random dis- 
tribution over a bin width of Az = 0.04 in each case (to 
match our actual binning). At each new redshift, the 
model spectra are added to sky-subtracted BOSS sky 
fibers to simulate realistic survey noise, and scaled in- 
dividually in fiux to give a typical median SNR at that 
redshift bin. We then analyse the simulated redshifted 
samples with our Bayesian method to estimate the pos- 
terior PDF of m and s. The results are shown in Fig- 
ure El and Figure IHl for 5 separate redshift bins. We see 
that the recovered parameters are consistent within ob- 
servational error across all redshifts, with no apparent 
redshift-dependent bias. 
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Fig. 5. — Contours of constant posterior probability density 
(68%, 95%, and 99%) for m and s parameters obtained from a 
controlled sub-sample of 152 galaxies in 5 different redshift bins 
with gradually reduced SNRs. 




Fig. 6. — The best estimated m (Diamond) and s (Triangle) val- 
ues for the controlled sub-sample of 152 galaxies at 5 different 
redshift bins. 

Finally, to rule out any significant dependence of our 
measurement on airmass and fiber position within the 
BOSS spectrographs, we make use of data from plates 
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3615, 3647, 4238 and 4239. These four plates cover 
roughly the same area of sky, but with different plate 
drillings that place the same objects in very different 
fibers within the spectrograph system. They were also 
observed over a range of different airmasses on multi- 
ple nights. From these plates, we construct several sub- 
samples of spectra, all of which include the same galaxies, 
but are drawn from different plates and/or observations. 
As with the previous tests, we recover consistent esti- 
mates of m and s from the analysis of all these samples. 

Based on the above three tests, we conclude that our 
method recovers accurate estimates of the population 
velocity-dispersion distribution parameters. 

3.4. Magnitude Error Correction 

Our method of determining p{di \ logj^g incorporates 
an explicit marginalization over redshift error, and prop- 
agates all observational uncertainty in the velocity dis- 
persion of a given galaxy. Our binning in redshift and ab- 
solute magnitude introduces additional error possibilities 
that we must account for. In the case of redshift, the er- 
rors are negligible relative to the bin width of Az = 0.04, 
and are unlikely to contribute any artificial broadening to 
our determination of the redshift dependence of m and 
s. The absolute magnitude errors are, however, non- 
negligible in comparison to the magnitude bin width of 
AMv — 0.1, and thus we use the following technique 
to estimate and compensate for the broadening effect of 
the observational scattering of galaxies between absolute- 
magnitude bins (see Figure [T]) . 

Suppose (m, s) are the true values within a bin, and 
(mi, si) are the values that we determine in the presence 
of absolute-magnitude errors. We assume that 

mi ~m + 5m (24) 

si = + 5s^ , (25) 

where 5m and 5s are the biases introduced by magni- 
tude errors. To estimate and remove these biases, we 
add additional random errors to all our galaxy absolute 
magnitudes My to give 



Mv + e<5Afv 



(26) 



where e is a normally distributed random number with 
mean and standard deviation 1, and 5 My are the 
galaxy-by-galaxy absolute-magnitude errors estimated 
by sdss2bessell (propagated from SDSS ugriz appar- 
ent magnitude errors). We repeat our analysis, binning 
instead in My, and denoting the new distribution pa- 
rameter results by m^ and S2- We assume these new 
determinations are related to mi and si in the same way 
as mi and si are related to m and s, which implies that 

m2 = m + 2 5m (27) 

(28) 



m + 2 5m 
s^ + 2 5s^ 



Thus the biases due to absolute magnitude errors 5m and 
5s can be removed to yield 



m — 2 mi — m2 



2sl 



(29) 
(30) 



4. RESULTS: EVOLUTION OF THE 
VELOCITY-DISPERSION FUNCTION 

In this section we present the results of the applica- 
tion of our hierarchical Bayesian velocity-dispersion dis- 
tribution measurement technique to the approximately 
103,000 galaxies in our LOZ sample and 330,000 galax- 
ies in our CMASS sample. 

4.1. LOZ Sample 

The LOZ sample extends to z « 0.5. The 2D contour 
plots of m and s (Figure [7]) and scatter plots in differ- 
ent redshift bins (Figure [5]) show that the mean m is 
strongly correlated with absolute magnitude, while the 
intrinsic scatter s shows no significant variation. Tracks 
of constant stella r mass assuming the L RG stellar pop- 
ulation model of iMaraston et aLr()2009D have also been 
over-plotted in Figure [3 and used to convert from an 
absolute-magnitude to a stellar-mass baseline in Fig- 
ureEl Galaxies in the LOZ sample have estimated stellar 
masses between approximately IO^^Mq and 

To quantify the variation of the m and s parameters 
with redshift and absolute magnitude, we consider a sim- 
ple model specified by: 

(31) 
(32) 

with the "0" superscript denoting the LOZ sample specif- 
ically. Performing a linear least squares fit to the indi- 
vidual bin data points, we obtain 



:AO,Mv + BS,logio(l + z)+C^ 
: AOMv + BOlogio(l + z) -HC7° 



= -0.0880 ±0.0012 
= -0.087 ±0.018 
C° = 0.37 ±0.02 



= 0.006 ±0.002 
B"^ = -0.08 ± 0.03 
C° = 0.20 ± 0.04 (33) 



We can translate the resulting scaling into the stan- 
dard form for the FJR, with luminosity L cx cr^ by rec- 
ognizing that X = — 0.4/Aj'„. The resulting value of 
X = 4.55 ± 0.06 is in reasonable agreement with the 
canonical local-universe value of a; = 4. Thus, BOSS 
low-z LRGs define an FJR whose slope and scatter has 
little dependence on redshift and luminosity; there is cor- 
respondingly little evidence for dynamical evolution in 
this sample since roughly z = 0.5. 

4.2. CMASS Sample 

The CMASS galaxy sample extends from z « 0.3 to 
z sa 0.8. The results of our a distribution parame- 
ter measurements are shown in Figures |9] and llOi once 
aga in using tracks of co nstant stellar mass based on 
the IMaraston et aD (|2009| ) population model. Using the 
same model form as used for the LOZ sample above, 

ml =Ai,Mv+B^logio(l + z)+C7i (34) 
s' - AiMv + Bl logi„(l + z) +Cl (35) 

(with the "1" superscript denoting the CMASS sample 
specifically), and again doing a linear least-squares fit, 
we find that 



In practice, we find typical values for 5m of 0.01, and for 
5s of 0.04. 



Al^ = -0.1128 ±0.0010 
S,^, = -0.77 ±0.02 
CL = -0.089 ± 0.019 



A] = 0.0263 ±0.0016 
Bl = 0.82 ± 0.04 
Cl = 0.52 ± 0.03 
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Contour Plot for mean m(Low-z LRGs) 



Contour Plot for Intrinsic Scatter s(Low-z LRGs) 
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Fig. 7. — Contour plots of m & s for LOZ sample galaxies. Top panels show the map of maximum posterior probability over the range 
of the plane with bins containing at least 100 galaxies. Middle panels show low-order bivariato model fits to these maps constructed as 
described in the text, and residuals (top minus middle) are shown in the bott om panels. Dashed lines in the top panels show tracks of 
constant stellar mass from the LRG population model of lMaraston et al] II2009I V 
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Fig. 8. — Scatter plots of m & s versus My (top two panels) and logjg (Af, /Mq) (bottom two panels) for LOZ sample galaxies in different 
redshift ranges. 

In the case of the CMASS sample, the FJR is stih 
apparent, but the scahng exponent in L oc ct^ is now 
X = 3.55 ± 0.03. This observation that the FJR be- 
comes "shallower" at higher redshift can be interpreted 
in terms of mass-dependent star-formation history (e.g., 
iCowie et all [l99l IdTSerego Ahghieri et al.l[2005f ). with 
less massive (lower a) galaxies having undergone more 
recent star formation and thus fading more rapidly with 
cosmic time relative to more massive galaxies. 

There is a clear evolution in the zero-point of the m 



versus My relation (upper left panel in Figure fTO)) with 
redshift. This evolution is essentially eliminated in the 
lower left panel of Figure [TUJ which translates to a base- 
line of constant stellar mass. Hence the evolution of the 
m versus A/y relation in the CMASS sample is consistent 
with passive stellar evolution. 

It can easily be seen from Figures l9l and [TOl that s is no 
longer constant with redshift at fixed luminosity or stel- 
lar mass. The significance of this result is encapsulated 
in the non-zero value of = 0.82±0.04 given above. To 
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Fig. 10. — The same as Figure[8]but for CMASS galaxies. (Note that the scales of these panels are expanded relative to Figure[8]) The 
increase of intrinsic scatter with redshift can be seen in the right-hand figure. 



quantify this result in more detail, we fit the s versus My 
relation with a linear model at each redshift bin, and plot 
the zero-point of this relation as a function of redshift in 
Figure [TT] We sec that within the CMASS sample, the 
intrinsic width s of the velocity-dispersion function at 
fixed magnitude or stellar mass decreases with cosmic 
time (i.e., broader distribution width at higher redshift), 
especially at redshifts z > 0.6. This is consistent with 
our tentative detection of evolution in the FJR slope be- 
tween the LOZ and CMASS samples, in the sense that 
a given range in luminosity encompasses a larger range 
of velocity dispersions at higher redshift, but the signal 



is too large to be explained by this effect alone (since 
the FJR slope is not seen to evolve significantly within 
the CMASS sample alone). We are therefore seeing in- 
creased dynamical heterogeneity at fixed luminosity in 
the CMASS sample at higher redshifts. 

We note that the apparent increase in the intrinsic 
cr distributions at high redshift cannot be explained in 
terms of surface-brightness selection effects. Through the 
FP or Kormendy relations, velocity dispersion at fixed 
luminosity is correlated with surface brightness. At the 
high-redshift end of the CMASS sample, we can expect 
a degree of incompleteness at both ends of the surface- 
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Fig. 11. — Variation of the intrinsic width s of the CMASS pop- 
ulation distribution in logj^y ct as a function of redshift. 

brightness distribution. On the one hand, relatively low 
surface-brightness galaxies will have fainter magnitudes 
within the BOSS spectroscopic fiber, and will thus be 
less likely to be targeted, and less likely to have confident 
and correct spectroscopic redshift measurements even if 
targeted. On the other hand, relatively high surface- 
brightness galaxies (again, at fixed luminosity) run the 
risk of being unresolved in star-galaxy separation. Con- 
sequently, we might expect the distribution of velocity 
dispersion at fixed magnitude to be made more narrow 
at high redshift by these considerations, which goes in 
the opposite sense to the trend we detect. 

4.3. Application to Individual Spectra 

Our results characterize the dynamical properties of 
the population of LRGs targeted by BOSS. The param- 
eters of the population can in turn be used to inform our 
estimates of the velocity dispersion values of individual 
noisy BOSS spectra. For this application, we want to use 
distribution parameters uncorrected for broadband mag- 
nitude errors, since these same errors will be present in 
the photometric data for the individual galaxies whose 
spectra we wish to analyze. 

For low-z LRGs, without magnitude error correction, 
we have 

m" = -0.0829Mv - 0.042 logio(l + z) +0.48 (36) 

0.006My-0.091ogio(l + ^) +0.22 (37) 

and for CMASS galaxies, without magnitude error cor- 
rection, we have 

TO^ =-0.0973Afy -0.60 logio(l + z) + 0.23 (38) 
0.0240M,/ +0.761ogio(l + z) + 0.49 (39) 

We can then take the posterior probability p{m, s\{d\) 
from the entire sample as a prior probability for the anal- 
ysis of an individual galaxy spectrum. The posterior 
probability for logj^g of the spectrum is then 

P(logioCT|dO cxp(d,|logiocr)p(m,s|{4) ■ (40) 

Loosely speaking, if the observational error in the veloc- 
ity dispersion measured from a single spectrum is compa- 
rable to the intrinsic width s of the particular population 
from which it is drawn, then the data and the prior will 
contribute equally to the determination of the posterior 



PDF of logj^g a. If the observational error is small, the 
effect of the prior will be correspondingly minor, while if 
the observational error is large, the posterior PDF will 
be determined primarily by the prior. 

The application of this method can thus permit a more 
precise a estimate for individual galaxies, by making use 
of the collective information about the population from 
which it was drawn. It is important however to note that 
if the spectra under consideration are somehow selected 
to be biased towards either higher or lower velocity dis- 
persions, then the prior will pull them systematically to- 
wards the population mean, giving posterior PDFs that 
are biased relative to the true a values. We must also be 
sure only to apply this method to subsamples of spectra 
that are much smaller than the population samples used 
to determine the distribution parameters. 

5. DISCUSSION & CONCLUSION 

In this paper, we have presented a new technique for 
estimating the velocity-dispersion function of LRGs from 
large numbers of low SNR spectra. This method incorpo- 
rates the effects of observational uncertainties in spectro- 
scopic redshift. velocity dispersion, and broadband mag- 
nitude. We have compared our method favorably to the 
more traditional approach of "stacking" multiple spec- 
tra; our new approach can perhaps be termed "Bayesian 
stacking" . We have also indicated how the results of our 
method can be used as informative priors to provide more 
precise estimates of the velocity dispersions of individual 
galaxies, provided that those galaxies are an unbiased 
selection from the parent distribution at their particular 
redshift and luminosity. 

We have applied our technique to a sample of 430,000 
galaxy spectra from the BOSS project of the SDSS-III, 
covering the redshift range from zero to unity, concen- 
trated between approximately z = 0.2 and z ~ 0.8. 
For the higher-redshift CMASS target sample (approx- 
imately 76% of our galaxies), we detect a highly sig- 
nificant increase in the intrinsic width of the velocity- 
dispersion distribution at higher redshifts, indicative of 
greater galaxy diversity at fixed luminosity at earlier cos- 
mic times. For the lower-redshift LOZ galaxy sample, we 
find little evolution in the velocity-dispersion distribution 
below z « 0.5. Although the CMASS and LOZ samples 
do not form a single uniform sample (LOZ galaxies being 
generally more luminous than CMASS targets over the 
range of redshift where the two overlap), our results sug- 
gest that dynamical evolution of massive LRGs is much 
more significant over the interval 0.5 < z < 1.0 as com- 
pared to < z < 0.5. 

Future applications of this method to the BOSS galaxy 
samples will focus on the effects of observational selection 
on the deduced population evolution. We also plan to di- 
vide our analysis further by rest-frame color, so as to dif- 
ferentiate between galaxies of different stellar population 
at a given redshift and magnitude. By making a more 
accurate division of the sample in terms of stellar mass 
and star-formation history, we hope to separate the sig- 
natures of dynamical and stellar-population evolutionary 
channels, and to thereby obtain a more detailed picture 
of LRG population evolution and a more powerful dis- 
criminant between theoretical scenarios. This approach 
can also determine whether the effect of increased popu- 
lation scatter in logj^Q a at high redshift is due to greater 
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dynamical diversity, greater stellar-population diversity, 
or to some combination of the two effects. 

Our measurements can also have important implica- 
tions for the statistics of gravitational lensing, by con- 
straining the total lensing cross-section in massive ellip- 
tical galaxies between redshift and 1. Although a pre- 
cise application to gravitational-lensing statistics must 
await a proper treatment of completeness, our current 
results can be combined with published luminos ity func- 
tions (e.g. JCimatti et al.ll2006tlCool et al.ll2008| ) to place 
a lower limit on the integrated lensing cross section. 

The application of hierarchical Bayesian methods such 
as the one presented here may hold the key to recon- 
ciling the tension between redshift surveys designed for 
constraining cosmological parameters and those designed 
for the study of galaxy evolution. The former goal gen- 
erally dictates an SNR just sufficient to measure red- 
shift for as many galaxies over as large a volume of the 
universe as possible, while the latter goal traditionally 
requires observations at high enough SNR to precisely 
constrain multiple physical parameters for each galaxy. 
However, if the ultimate goal of galaxy-evolution stud- 
ies is to measure the distribution of physical parameters 
within a statistically significant sample of galaxies, then 
Bayesian methods can remove the need to measure those 
parameters precisely on a galaxy-by-galaxy basis. In fact, 
there may indeed be an objective galaxy-evolution case 
for trading fewer high-SNR spectra for more low-SNR 
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